Search CORE

4 research outputs found

Deep learning models for predicting RNA degradation via dual crowdsourcing

Author: Amer Karim
Chiu King Yuen
Das Rhiju
Demkin Maggie
Fares Mohamed
Fujikawa Kazuki
Gao Jiayang
He Shujun
Ishi Keiichiro
Ito Takuya
Kim Do Soon
Kladwang Wipapat
Lee Youhan
Mao Hanfei
Nicol John J.
Noumi Taiga
Onodera Kazuki
Reade Walter
Romano Jonathan
Steenwinckel Bram
Tinti Michele
Tunguz Bojan
Vandewiele Gilles
Watkins Andrew M.
Wayment-Steele Hannah K.
Wellington-Oguri Roger
Öztürk Emin
Öztürk Fatih
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

Medicines based on messenger RNA (mRNA) hold immense potential, as evidenced by their rapid deployment as COVID-19 vaccines. However, worldwide distribution of mRNA molecules has been limited by their thermostability, which is fundamentally limited by the intrinsic instability of RNA molecules to a chemical degradation reaction called in-line hydrolysis. Predicting the degradation of an RNA molecule is a key task in designing more stable RNA-based therapeutics. Here, we describe a crowdsourced machine learning competition (‘Stanford OpenVaccine’) on Kaggle, involving single-nucleotide resolution measurements on 6,043 diverse 102–130-nucleotide RNA constructs that were themselves solicited through crowdsourcing on the RNA design platform Eterna. The entire experiment was completed in less than 6 months, and 41% of nucleotide-level predictions from the winning model were within experimental error of the ground truth measurement. Furthermore, these models generalized to blindly predicting orthogonal degradation data on much longer mRNA molecules (504–1,588 nucleotides) with improved accuracy compared with previously published models. These results indicate that such models can represent in-line hydrolysis with excellent accuracy, supporting their use for designing stabilized messenger RNAs. The integration of two crowdsourcing platforms, one for dataset creation and another for machine learning, may be fruitful for other urgent problems that demand scientific discovery on rapid timescales

PubMed Central

University of Dundee Online Publications

Deep learning models for predicting RNA degradation via dual crowdsourcing

Messenger RNA-based medicines hold immense potential, as evidenced by their rapid deployment as COVID-19 vaccines. However, worldwide distribution of mRNA molecules has been limited by their thermostability, which is fundamentally limited by the intrinsic instability of RNA molecules to a chemical degradation reaction called in-line hydrolysis. Predicting the degradation of an RNA molecule is a key task in designing more stable RNA-based therapeutics. Here, we describe a crowdsourced machine learning competition ("Stanford OpenVaccine") on Kaggle, involving single-nucleotide resolution measurements on 6043 102-130-nucleotide diverse RNA constructs that were themselves solicited through crowdsourcing on the RNA design platform Eterna. The entire experiment was completed in less than 6 months, and 41% of nucleotide-level predictions from the winning model were within experimental error of the ground truth measurement. Furthermore, these models generalized to blindly predicting orthogonal degradation data on much longer mRNA molecules (504-1588 nucleotides) with improved accuracy compared to previously published models. Top teams integrated natural language processing architectures and data augmentation techniques with predictions from previous dynamic programming models for RNA secondary structure. These results indicate that such models are capable of representing in-line hydrolysis with excellent accuracy, supporting their use for designing stabilized messenger RNAs. The integration of two crowdsourcing platforms, one for data set creation and another for machine learning, may be fruitful for other urgent problems that demand scientific discovery on rapid timescales

arXiv.org e-Print Archive

PubMed Central

University of Dundee Online Publications

Artificial intelligence for diagnosis and Gleason grading of prostate cancer: The PANDA challenge

Author: Allan Robert
Amin Mahul B.
Billoch-Lima Jorge
Brilhante Américo
Bulten Wouter
Cai Yuannan
Chai Tianrui
Chen Po-Hsuan Cameron
Corrado Greg S.
Dane Sohier
Delahunt Brett
Demkin Maggie
Egevad Lars
Eklund Martin
Evans Andrew J.
Farré Xavier
Geronatsiou Katerina
Grechka Dmitry
Grönberg Henrik
He Shujun
Hulsbergen-van de Kaa Christina
Humphrey Peter A.
Häkkinen Tomi
Ji Jianyi
Kartasalo Kimmo
Litjens Geert
Mermel Craig H.
Molinié Vincent
Nagpal Kunal
Ono Kosaku
Peng Lily
Pereira Emíio M.
Pereira Guilherme
Pinckaers Hans
Roussel Arnaud
Roy Paromita
Ruusuvuori Pekka
Saile Günter
Salles Paulo G. O.
Samaratunga Hemamali
Schaafsma Ewout
Shen Tao
Song Sejun
Steiner David F.
Ström Peter
Sun Qing
Tan Fraser
Tschui Joëlle
Tsuzuki Toyonori
Valkonen Masi
van Boven Hester
van der Kwast Theodorus
van der Laak Jeroen
Vink Robert
Weng Nina
Yamaguchi Taiki
Yoshihara Hiroshi
Zhou Kairong
Zhou Ming
Çakır Aslı
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

Through a community-driven competition, the PANDA challenge provides a curated diverse dataset and a catalog of models for prostate cancer pathology, and represents a blueprint for evaluating AI algorithms in digital pathology. Artificial intelligence (AI) has shown promise for diagnosing prostate cancer in biopsies. However, results have been limited to individual studies, lacking validation in multinational settings. Competitions have been shown to be accelerators for medical imaging innovations, but their impact is hindered by lack of reproducibility and independent validation. With this in mind, we organized the PANDA challenge-the largest histopathology competition to date, joined by 1,290 developers-to catalyze development of reproducible AI algorithms for Gleason grading using 10,616 digitized prostate biopsies. We validated that a diverse set of submitted algorithms reached pathologist-level performance on independent cross-continental cohorts, fully blinded to the algorithm developers. On United States and European external validation sets, the algorithms achieved agreements of 0.862 (quadratically weighted kappa, 95% confidence interval (CI), 0.840-0.884) and 0.868 (95% CI, 0.835-0.900) with expert uropathologists. Successful generalization across different patient populations, laboratories and reference standards, achieved by a variety of algorithmic approaches, warrants evaluating AI-based Gleason grading in prospective clinical trials.KWF Kankerbestrijding ; Netherlands Organization for Scientific Research (NWO) ; Swedish Research Council European Commission ; Swedish Cancer Society ; Swedish eScience Research Center ; Ake Wiberg Foundation ; Prostatacancerforbundet ; Academy of Finland ; Cancer Foundation Finland ; Google Incorporated ; MICCAI board challenge working group ; Verily Life Sciences ; EIT Health ; Karolinska Institutet ; MICCAI 2020 satellite event team ; ERAPerMe

İstanbul Medipol University Institutional Repository

Artificial intelligence for diagnosis and Gleason grading of prostate cancer : the PANDA challenge

Author: Allan Robert
Amin Mahul B.
Bulten Wouter
Cai Yuannan
Chen Po Hsuan Cameron
Corrado Greg S.
Dane Sohier
Delahunt Brett
Demkin Maggie
Egevad Lars
Eklund Martin
Evans Andrew J.
Grönberg Henrik
Hulsbergen-van de Kaa Christina
Humphrey Peter A.
Häkkinen Tomi
Kartasalo Kimmo
Litjens Geert
Mermel Craig H.
Nagpal Kunal
Peng Lily
Pinckaers Hans
Ruusuvuori Pekka
Samaratunga Hemamali
Steiner David F.
Ström Peter
Tan Fraser
the PANDA challenge consortium
Tsuzuki Toyonori
Valkonen Masi
van Boven Hester
van der Kwast Theodorus
van der Laak Jeroen
Vink Robert
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2022
Field of study

Artificial intelligence (AI) has shown promise for diagnosing prostate cancer in biopsies. However, results have been limited to individual studies, lacking validation in multinational settings. Competitions have been shown to be accelerators for medical imaging innovations, but their impact is hindered by lack of reproducibility and independent validation. With this in mind, we organized the PANDA challenge—the largest histopathology competition to date, joined by 1,290 developers—to catalyze development of reproducible AI algorithms for Gleason grading using 10,616 digitized prostate biopsies. We validated that a diverse set of submitted algorithms reached pathologist-level performance on independent cross-continental cohorts, fully blinded to the algorithm developers. On United States and European external validation sets, the algorithms achieved agreements of 0.862 (quadratically weighted κ, 95% confidence interval (CI), 0.840–0.884) and 0.868 (95% CI, 0.835–0.900) with expert uropathologists. Successful generalization across different patient populations, laboratories and reference standards, achieved by a variety of algorithmic approaches, warrants evaluating AI-based Gleason grading in prospective clinical trials.publishedVersionPeer reviewe

Publikationer från Linköpings universitet

PubMed Central

Digitala Vetenskapliga Arkivet - Academic Archive On-line

Trepo - Institutional Repository of Tampere University